Correction of sequence-dependent ambiguous bases (Ns) from the 454 pyrosequencing system

نویسندگان

  • Sunguk Shin
  • Joonhong Park
چکیده

Pyrosequencing of the 16S ribosomal RNA gene (16S) has become one of the most popular methods to assess microbial diversity. Pyrosequencing reads containing ambiguous bases (Ns) are generally discarded based on the assumptions of their non-sequence-dependent formation and high error rates. However, taxonomic composition differed by removal of reads with Ns. We determined whether Ns from pyrosequencing occur in a sequence-dependent manner. Our reads and the corresponding flow value data revealed occurrence of sequence-specific N errors with a common sequential pattern (a homopolymer + a few nucleotides with bases other than the homopolymer + N) and revealed that the nucleotide base of the homopolymer is the true base for the following N. Using an algorithm reflecting this sequence-dependent pattern, we corrected the Ns in the 16S (86.54%), bphD (81.37%) and nifH (81.55%) amplicon reads from a mock community with high precisions of 95.4, 96.9 and 100%, respectively. The new N correction method was applicable for determining most of Ns in amplicon reads from a soil sample, resulting in reducing taxonomic biases associated with N errors and in shotgun sequencing reads from public metagenome data. The method improves the accuracy and precision of microbial community analysis and genome sequencing using 454 pyrosequencing.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Evaluating the potential of 18S rDNA clone libraries to complement pyrosequencing data of marine protists with near full-length sequence information

Sequencing of 18S rDNA clone libraries and 454-pyrosequencing are valuable methods used to describe microbial diversity. The massively parallel 454-pyrosequencing generates vast amounts of ribosomal sequence data and has the potential to uncover more organisms, even rare species. However, the relatively short sequence lengths of ∼500 bp are suboptimal for taxonomic annotation and phylogenetic a...

متن کامل

Quality Score Based Identification and Correction of Pyrosequencing Errors

Massively-parallel DNA sequencing using the 454/pyrosequencing platform allows in-depth probing of diverse sequence populations, such as within an HIV-1 infected individual. Analysis of this sequence data, however, remains challenging due to the shorter read lengths relative to that obtained by Sanger sequencing as well as errors introduced during DNA template amplification and during pyroseque...

متن کامل

Indel and Carryforward Correction (ICC): a new analysis approach for processing 454 pyrosequencing data

MOTIVATION Pyrosequencing technology provides an important new approach to more extensively characterize diverse sequence populations and detect low frequency variants. However, the promise of this technology has been difficult to realize, as careful correction of sequencing errors is crucial to distinguish rare variants (∼1%) in an infected host with high sensitivity and specificity. RESULTS...

متن کامل

Lessons learned from microsatellite development for nonmodel organisms using 454 pyrosequencing.

Microsatellites, also known as simple sequence repeats (SSRs), are among the most commonly used marker types in evolutionary and ecological studies. Next Generation Sequencing techniques such as 454 pyrosequencing allow the rapid development of microsatellite markers in nonmodel organisms. 454 pyrosequencing is a straightforward approach to develop a high number of microsatellite markers. There...

متن کامل

Correction: Spatial Variation of the Gut Microbiota in Broiler Chickens as Affected by Dietary Available Phosphorus and Assessed by T-RFLP Analysis and 454 Pyrosequencing

The second author’s name is misspelled. The correct name is: Amelia Carminha-Silva. The correction citation is: Witzig M, Camarinha-Silva A, Green-Engert R, Hoelzle K, Zeller E, Seifert J, et al. (2015) Spatial Variation of the Gut Microbiota in Broiler Chickens as Affected by Dietary Available Phosphorus and Assessed by T-RFLP Analysis and 454 Pyrosequencing. PLoS ONE 10(11): e0143442. 10.1371...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره 42  شماره 

صفحات  -

تاریخ انتشار 2014